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Abstract — In a cognitive radio setting, secondary users op- 
portunistically access the spectrum allocated to primary users. 
Finding the optimal sensing and transmission durations for 
the secondary users becomes crucial in order to maximize the 
secondary throughput while protecting the primary users from 
interference and service disruption. In this paper an adaptive 
sensing and transmission scheme for cognitive radios is proposed. 
We consider a channel allocated to a primary user which operates 
in an unslotted manner switching activity at random times. 
A secondary transmitter adapts its sensing and transmission 
durations according to its belief regarding the primary user 
state of activity. The objective is to maximize a secondary 
utility function. This function has a penalty term for colli- 
sions with primary transmission. It accounts for the reliability- 
throughput tradeoff by explicitly incorporating the impact of 
sensing duration on secondary throughput and primary activity 
detection reliability. It also accounts for throughput reduction 
that results from data overhead. Numerical simulations of the 
system performance demonstrate the effectiveness of adaptive 
sensing and transmission scheme over non-adaptive approach in 
increasing the secondary user utility.^ 

I. Introduction 

In cognitive radio networks, secondary or unlicensed users 
are allowed to share the spectrum with primary, licensed users. 
Unlike primary users who can access the spectrum at will 
at any time, secondary users have to search for the vacant 
slots in the spectrum and opportunistically access the spectrum 
without causing interference to the primary users. Secondary 
users carry out spectrum sensing to detect the state of the 
primary users. When there is no primary activity over a certain 
band, the secondary users can utilize the band to transmit their 
, own data. 

One important aspect in cognitive radio networks that has 
received wide attention in research is to find the optimal 
■ sensing and transmission strategies for the secondary user 
This includes, inter alia, the determination of the optimal 
inter-sensing time for unslotted primary networks [IJ, [2J, 
specifying the optimal channel order for sensing and access 
fTl, [4 1, and finding the optimal sensing duration based on 
secondary observations 15|. Note that the detection of primary 
users becomes more reliable as the sensing duration increases. 
On the other hand, and under the assumption that a secondary 
user either senses or transmits over a channel, a long sensing 
duration means decreasing the time available for transmission. 

^This work was supported in part by a grant from the Egyptian NTRA 
(National Telecommunications Regulatory Authority). 



Sensing constitutes an overhead that comes at the expense of 
transmission [6]. This is true in both slotted and unslotted 
primary systems |5| indicating the existence of a tradeoff 
between sensing reliability and secondary throughput Q. 

The authors of fS | develop their preliminary work in f9\ and 
design a secondary access scheme that optimizes the secondary 
access efficiency while protecting the primary transmission 
from interference. The primary mode of operation is un- 
slotted, which means that its active and idle times are random 
variables. A utility function is developed to account for the 
secondary throughput and to penalize it for colliding with 
primary transmission. During the idle primary period, the 
secondary can either sense or transmit. Both the sensing and 
transmission durations are assumed to be fixed. The optimal 
solution is threshold-based such that the secondary transmits 
when its belief about the primary being idle exceeds a certain 
threshold. The belief is updated based on the secondary 
sensing observations and the feedback it receives from its 
respective receiver. The authors assume that the secondary user 
can perfectly detect the start point of the primary off duration. 
The problem of the quickest detection of transmission oppor- 
tunity is addressed in IfTOl and IfTTI . and is beyond the scope 
of this work. 

We build on the work in [8| and make the following 
contributions. Instead of using fixed sensing and transmis- 
sion durations for the secondary users, we consider varying 
the durations according to the belief of the secondary user 
concerning the primary activity. The durations become opti- 
mization variables that parameterize secondary utility function. 
The motivation for this is that the secondary transmitter may 
waste time and energy in long sensing periods although it has 
a high belief that the primary user is idle. On the other hand, 
the secondary transmitter may relatively increase its sensing 
period to detect the actual state of the primary user if it has 
a considerable belief that the primary user is busy. This long 
sensing duration makes the sensing outcome more reliable and 
reduces the probability of collision with the primary. There- 
fore, adaptive sensing and transmission durations can enhance 
the secondary throughput and afford more protection to the 
primary compared to the case of fixed durations. In addition, 
previous works also consider the secondary throughout on the 
basis of the whole transmission duration. This ignores the data 
overhead which makes it better for the secondary user to make 
one relatively long transmission instead of making multiple 



small transmissions. In a cognitive setting, however, a long 
transmission duration increases the probability of colliding 
with the primary user. Thus there is another tradeoff here if the 
overhead is accounted for as we do in this paper. We also allow 
the secondary to remain idle because in practice both sensing 
and transmission have a cost in terms of power consumption. 

The rest of the paper is organized as follows. The system 
model and problem formulation are described in Section |lll 
We present the adaptive sensing and transmission scheme in 
Section |III] In Section |IV] we provide simulation results and 
compare between the non-adaptive and adaptive schemes. We 
conclude the paper in Section |V] 

II. System Model and Problem Formulation 

We consider a channel allocated to a primary Tx-Rx pair 
which operates in an unslotted manner, switching activity at 
random times. A secondary terminal attempts to opportunis- 
tically access this channel maximizing its throughput while 
simultaneously minimizing the probability of colliding with 
primary transmission. 

The primary user's activity follows an alternating on/off 
renewal process with certain probability distributions for the 
idle and busy periods: /off (t) and /on (t) with means Toff and 
Ton, respectively. The idle and busy periods are independent of 
each other We assume that there is no cooperation between the 
primary and secondary users. The secondary user can quickly 
and reliably detect the transition of the primary user from busy 
to idle. This transition represents t = 0. 

A. Secondary Actions 

We assume that the secondary user always has data to 
transmit. Its objective is to enhance its transmission throughput 
while protecting the primary user from interference. The 
secondary transmitter can perform one of three actions: stay 
idle {/}, carry out spectrum sensing {S}, or transmit its 
data {T} with action space A — {I, S, T}. Let us define 
the immediate and expected future reward that the secondary 
user gains from taking action i as (M for myopic) and 
i?f (L for long term), respectively, where i e {/, S, T}. Let 
p (t) denote the secondary belief that the primary user is idle, 
p{t) G [0,1]. After performing an action i and obtaining a 
corresponding observation O, the updated belief is £^ {p). The 
observations are the sensing outcome or the acknowledgment 
received from the secondary receiver in case of transmission. 
If there is no observation associated with the action such as 
when the secondary user remains idle, the update is £i {p). The 
probability of observing O associated with action i is given 
by wf . 

The secondary utility function Us {p (t) , t) is given by 

Us{p{t),t)=m&,,{I{p{t),t),S{p{t),t),T{p{t),t)} 

(1) 

where I{p{t),t), S{p{t),t) and T{p{t),t) are the sec- 
ondary user's maximum expected utilities for taking the action 
of staying idle, carry out spectrum sensing or make data 
transmission respectively. Next we discuss how to formulate 
these three utilities depending on the secondary user action. 



1) First action "stay idle": Although we assume that the 
secondary user always has data to transmit, staying idle is 
sometimes the optimal action. It is better for the secondary 
transmitter, if the primary user is highly likely to be busy, to 
stay idle and conserve its energy than to consume its energy in 
sensing because it is more likely that the sensing outcome for 
the next few actions would be busy. This assumption is based 
on our primary traffic model which makes it more likely to 
sense the channel in the same state as the inter-sensing time 
diminishes |[l]. When the idle action is chosen, the secondary 
transmitter conserves its energy but on the other hand this 
causes a reduction in its throughput. We define the following 
terms: 

Tj: Time of staying idle 

Kj: Cost of staying idle per unit time 

For simplification we assume the time unit =1. 

Therefore the cost of staying idle for Tj units is 

Ci (Ti) = KiTi (2) 

Parameter Kj has units of rate. This is because, as in (W\, the 
utility function is mainly based on the secondary throughput. 
Cost Kj is defined as the secondary rate minus the energy 
saved per unit time expressed in terms of rate. The immediate 
reward of the secondary user is given by 

i?f - -Ci (Ti) (3) 

Since the action of the secondary user is to stay idle, there 
are no observations after the time T/. The belief is updated as 
follows 

£i (p) =p{t + Ti)=p (i) Poo (Ti) + {1-p (t)) P,o (Ti) 

(4) 

where Poo (t) is the probability of the channel to be idle at 
time instant t + t' if it is idle at time t', whereas Pio (t) is the 
probability of the channel to be idle at time instant t + t' if 
it is busy at time t'. Probabilities Poo (t) and Pio (t) depend 
on the on and off distributions and are provided in [1]. See 
the Appendix for the derivations of Poo {t) and Pio (t) for the 
uniform distribution as we use it in our numerical simulations. 

Therefore the expected future reward that the secondary user 
gains after staying idle for time T/ can be written as: 

Rf = Us{£i{p),t + Ti) (5) 

We can then write the secondary user maximum expected 
utility / {p, t) for taking the action of staying idle as: 

/(pW,t)-Pf +/3Pf (6) 

where /3 G [0, 1] is the discounting factor At /3 = 0, the 
secondary user only care for the immediate reward and does 
not take the future into account. The value of /3 is usually very 
close to one. 

2) Second action " spectrum sensing" : The secondary user 
senses the spectrum to detect spectral vacancies. Sensing has 
its cost expended to detect the presence of a signal and to 
acquire a sufficient number of samples to yield reliable results. 
We define the following parameters that identify the sensing 



cost: 

Ts'. Sensing time 

Ks'. Sensing cost/time unit 

The sensing cost for Ts units as a function of the sensing time 
is as follows: 

Cs {Ts) = KsTs (7) 



The immediate reward is then 



Rl^^-CsiTs) 



(8) 



The outcome of the sensing process is either free {O = F} or 
busy {O = B}. Spectrum sensing introduces false alarms and 
mis-detections, which are decreasing functions of the sensing 
time Ts- For a target detection probability Pd , the probability 
of false alarm is related to the target detection probability as 
follows Q: 



(9) 



where Q {.) is the complementary distribution function of 
the standard Gaussian and fs is the sampling frequency. The 
number of samples used for detecting the primary activity is 
fs Ts and ip is the received signal-to-noise ratio (SNR). 

We introduce now a quantity that is important in the 
construction of the secondary utility function. This quantity 
is the probability that the primary user remains idle during 
the secondary user action given that the primary user is idle 
Is]. We denote this conditional probability as qi (t), where i 
is a possible secondary action at time t. Let X be a random 
variable describing the duration over which the primary user 
remains inactive. The probability qi (t) is given by 



q,{t) = Pr{X >t + T,\X >t} 
Pr{X >t + T,} 



Pr{X > t} 

l-Fx{t + Ti_ 



(10) 
(11) 

(12) 



1 - Fx [t] 

where Fx (•) is the cumulative distribution function of the 
primary user idle time, and Ti is the time duration of the 
secondary's ith action. The characteristics of the qi (t) function 
vary according to the distribution of the idle period of the 
primary user. The function qi {t) is a decreasing function 
of time for many distributions such as uniform distribution, 
Gaussian distribution and Rayleigh distribution because as 
time increases the probability that the primary user return 
increases. For an exponential distribution, qi is constant due 
to the memoryless property. 

In case the secondary user decides to carry out spectrum 
sensing, the probability that the sensing outcome is free is 

w§^p (t) qs {t) (1 - Pfa {Ts)) + {l-p {t) qs {t)) (1 - Pd) 

(13) 

The assumption underlying this formula is that the sensing 
outcome is free only when the primary remains idle over all the 
sensing duration. This is a valid assumption given the traffic 
model so long as the sensing duration is small compared to 



the mean on/off durations. Toff and To,,. The belief update can 
be written as follows using Bayes rule. 



£i {p)=p{t + Ts) = 



p{t)qs {t){l-Pfa {Ts)) 



wi 



(14) 



Similarly, if spectrum sensing is carried out, the probability of 
a busy outcome is 

^P {t) qs {t) Pfa {Ts) + {1-P {t) qs {t)) Pd (15) 
The belief update is consequently given by 

£|(p)=Wt + T.) = ^ii^)M)^£^iM (16) 

The expected future reward when spectrum sensing is carried 
out 

R^s = w^Us {p),t + Ts) + w^Us {£§ {p),t + Ts) 

(17) 

And hence we can write the maximum expected utility S {p, t) 
that the secondary user gains from sensing the spectrum as: 

S{p{t),t)^Rl' +PR^ (18) 

3) Third action "data transmission": The immediate re- 
ward for the secondary transmitter after transmitting its 
date differs from the previous actions as there will be a reward 
for successful transmissions (increment in the secondary user 
throughput) and a collision cost for colliding with the primary 
user ( penalizing the interference to the primary user). Define 
the following terms: 

R: Reward/ time unit for successful transmission 

a: Overhead time 

Cc- Collision Cost/ time unit 

Tt'- Transmission time 

Kt- Transmission cost/ time unit 

The transmission energy can be written as: 



Ct {Tt) — Kj^Ft 



(19) 



The collision cost Cc can be written as a function of a factor 
7 which is controlled by the primary user to prevent the 
secondary user from transmitting a lot without caring for the 
primary user, where 76 [0, 1]. When 7 = 0, the primary user 
is afforded maximum protection. 



Cn — Ci 



Cmax 



(1-7) 



(20) 



The secondary receiver sends an acknowledgment to the 
secondary transmitter upon the processing of the received 
packet. An ACK is sent for correct decoding, while a NACK 
means that the receiver has failed in decoding the transmitted 
message. Thus the observations when the secondary terminal 
transmits are {O = A] or {O = N}. Note that receiving a 
NACK from the secondary receiver does not mean that a 
collision with the primary user has occurred, because receiving 
a NACK may result, for instance, from deep channel fades 
between the secondary transmitter and secondary receiver. On 
the other hand it is possible that the secondary receiver can 
successfully decode the secondary transmitter message even 
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Adaptive Sensing-Transmission Structure 



Fig. 1. Traditional vs. Adaptive Sensing-Transmission Structures 



when the primary user is transmitting concurrently. Define 
the following probabilities: 

Pnc- probability that the secondary transmitter receives a 
NACK although no collision with the primary has occurred. 
Pc'. probability that the secondary transmitter receives a 
NACK given that a collision happened with the primary user 

The immediate expected reward that the secondary user gains 
after data transmission is: 

R¥ = [pit) qrit) (1 - Pnc) + (1 - p{t) grit)) (1 - Pc)] 
R{TT-a)-{l- p{t) qrit)) CcTt - Ct {Tt) 

(21) 

The probability that the secondary user receives an ACK is 

= p{t) qrit) (1 - Pnc) + (1 - pit) grit)) (1 - Pc) 

(22) 

The probability that the primary user is idle after Tt given 
that the secondary transmitter receives an ACK is: 

p{t) qrit) (l- Pnc) 



8^{p)=p{t + TT) 



(23) 



Similarly, the probability that the secondary user receives a 
NACK is 



= p{t) grit) Pnc + (1 - p(<) Qrit)) Pc (24) 



The update in the NACK case is 



£^{p)=p{t + TT) = 



p{t)qT{t)PNC 



(25) 



The expected future reward is 

= Us {S^ {p),t + Tt) + Us {S^ ip),t + Tt) 



Finally, 



(26) 
(27) 



T{p{t),t) = +/3i?^ 

III. Adaptive Sensing and Transmission 

Many existing works on cognitive radio networks assume 
that the secondary users have fixed sensing and transmission 
durations as illustrated in Fig.[T] This is not necessarily optimal 
as the secondary transmitter should not waste time and energy 
in long sensing periods although it has a high belief that the 



primary user is idle. This time can be exploited in relatively 
long transmissions, thereby increasing secondary throughput 
and maximizing its utility. On the other hand, at low values 
of p the secondary transmitter can increase its sensing time 
to get reliable results on the primary user state as increasing 
the sensing time decreases the false alarm probability and the 
probability of mis-detection. In this case, it can also decrease 
its transmission duration to reduce the probability of collision 
with the primary user. 

The secondary transmitter can adaptively vary its sensing 
and transmission durations according to its belief about the 
primary user state p{t). In the sequel, we fix Tj and assume 
that the sensing and transmission times are linear functions of 
p. That is, 

Tt (p) ao + aip (28) 



Ts (p) =bo- bip 



(29) 



where oq, ai, bo and bi are our design parameters. Note 
that if a fixed Tt or Ts is optimal, then the solution of the 
optimization problem would yield close-to-zero ai or bi. 

Parameters uq, ai, bo and 61 are nonnegative and obey the 
following inequalities: 



0-0 > Tt, min 

bo — bi > Ts^min 

ao + o-i < Tt ,max 

^0 < T's,max 



(30) 



where the positive parameters TT^min , 7s,min , Tr.max and 
Ts.max are the minimum and maximum transmission and 
sensing durations, respectively. The minimum duration for 
transmission is dictated by data overhead and the shortest 
possible data payload, whereas the minimum time for sensing 
is dictated by a minimal detection reliability requirement. 
The maximum durations are imposed to protect the primary 
by frequently checking its activity. Moreover, the maximum 
durations should be considerably less than Toff and Ton in 
order for the probability formulas to be valid. The first two 
constraints in (l30l l maintain that the sensing and transmission 
durations are not less than the minimum specified values for 
all e [0,1]. 

Our objective is to obtain the optimal sensing and trans- 
mission durations to maximize the secondary user utility. In 
other words, our objective is to dynamically decide, for each 
secondary user action, the optimal sensing and transmission 
durations to maximize the secondary user utility. Define 



S* [p [t) , t) = max S {p (t) , t) 

ao,ai,ba,bi 

T* {p (t) , t) = max T {p (t) , t) 

a.a,ai ,ba,bi 



(31) 



In the adaptive case, the secondary utility then becomes 

Usipit),t) = 

max {/ {p (t) ,t),S* {p it) ,t),T* (p (t) , t)} 
The optimal parameters if the secondary action is to sense or 



(32) 




Fig. 2. Secondary user utility Us{p,t) as a function of p at 4 = 200. 
Tlie tliree components of tlie utility function, I{p, t), S{p, t), and T(p, t) are 
depicted. The figure shows the threshold-based nature of the optimal policy. 

to transmit are Aq, a*, fe^j &i such that 

ao,al,bQ,bl= argmax S {p {t) , t) 

a.a,ai.bQ,bi 

if S* (p (t) , t) > max {/ {p (t) ,t),T* (p (t) , t)} 
or , a J , 6q , 6^ = argmax T {p{t) ,t) 

if T* {p it) , t) > max {/ (p (t) ,t),S* {p (t) , t)} 

The optimal action for the secondary user can be found 
either by using value iteration or backward induction. Em- 
ploying value iteration with /3 < 1, we initialize the matrix 
Us {p (t) , t) with zeros. We iterate using (l32T i until conver- 
gence 1 12|. Hence we obtain the optimal action for each p and 
t. Another method which is used in this paper for the numerical 
results is to use backward induction with (3 = 1. The possi- 
bility of doing backward induction with a unity discounting 
factor is predicated on the monotonically decreasing nature of 
the function qi (t) given by (fT2] i for some distributions. That is, 
as time proceeds, the probability of the primary user remaining 
idle during the sensing or transmission phase approaches zero. 
This means that regardless of the value of p, at large times, 
the secondary utility function is given by 

Us (p, t) = max { i?f , R^^ , R^' } (33) 

for large t such that qi (t) is almost zero. Given these values, 
backward induction can be used to get all Us values at different 
p and t. 

As shown in the next section, our solution is a threshold- 
based policy as in |8|. This means that the secondary transmits 
when its belief about the primary being idle exceeds a certain 
threshold. 

IV. Simulation Results 

The simulation results section has three main parts. First we 
show the relation between the secondary user utility Us {p, t) 
and the belief state p at certain time t. In the second part, we 
provide simulation results for the traditional scheme where 
the sensing and transmission durations for the secondary 



Fig. 3. Secondary utility Us (p, t) as a function of 7 with Tt = 10, Tj = 5, 
and various values for Tg. The case here is non-adaptive transmission and 
sensing durations with perfect sensing and no data overhead. Utility Us (p, t) 
increases with 7 as the collision penalty decreases. A higher sensing duration 
is a waste of transmission opportunities as sensing is assumed to be perfect. 
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Fig. 4. Secondary utility Us (p, t) as a function of 7 with Tg = 1, 
Tj = 5, and various values for Tj'. The case here is non-adaptive transmission 
and sensing durations with perfect sensing and no data overhead. Utihty 
Us (p, t) increases with 7 as the collision penalty decreases. A higher 
transmission duration reduces the secondary utility due to the increase in 
collision probabihty. Hence, the reduction in Us (p, t) caused by a high Tx 
decreases as 7 increases. 



transmitter are fixed. Finally, we simulate our adaptive scheme 
and compare it with the fixed one. In the second and third part 
we study perfect/imperfect sensing with/without the overhead. 
The results demonstrate the performance enhancement due to 
adapting the sensing and transmission durations. 

For the results below in the three parts we use the following 
simulation parameters unless otherwise mentioned. For the idle 
action, the secondary user idle duration is T/ = 5 , cost 
of staying idle per time unit Kj = 0.001. For the sensing 
action, we use the sensing cost per time unit Ks = 0.1. The 
transmission action parameters are as follows: transmission 
cost per time unit Kt = 0.1, overhead time a = 1, reward 
per time unit for successful transmission R = 1. As mentioned 
before, < 7 < 1 is a factor controlled by the primary user 
to control the secondary user transmissions by changing the 
collision cost. At 7 = 0, a maximum protection is required 



0.45 




Y 

Fig. 5. Secondary utility versus 7 with Tg = 1, T/ = 5, and various 
Tj' values under the non-adaptive scheme assuming perfect sensing, but 
considering data overhead. At small 7 values, long transmission durations 
reduce utility due to collision penalty. As 7 increases, the collision penalty 
decreases and short transmission durations result in a low utility due to the 
overhead. 
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Fig. 6. This figure shows the same situation as Fig. |3] but allowing for 
sensing errors and assuming a data overhead. Note that the best performance 
corresponds to the intermediate value for Tg. The reason is that although 
increasing the sensing duration comes at the expense of transmission duration, 
it increases decision rehability and reduces lost transmission opportunities 
caused by false alarm. 

which is equivalent to Cc — Ccmax — 20, the other extreme 
case happened at 7 — 1 which gives Cc — 0. we assume 
that Pnc — and Pc — 1 for all simulation parts. For the 
primary user, /on (t) and /off (t) are uniform over the interval 
[0, 1000]. We use also backward induction where /? = 1. 

A. Secondary user utility function characteristics 

Fig. 12] shows the relation between the secondary user utility 
Us {p, t — 200) and the belief state p in the perfect sensing 
with overhead case. Note that Us {p, t — 200) is a convex 
and increasing function in p. It can be shown following an 
argument similar to that in fFj that Us {p, t) is a convex and 
increasing function in p at any value of t. For this figure, we 
use a fixed sensing time Tg — 20 and a fixed transmission 
time Tt = 7. For the collision cost we set 7 ~ 0.5. 

The threshold based structure is obvious in Fig. |2] where 
pI (t = 200) = 0.3939 andp* {t = 200) = 0.9522. Atp < p*. 



/ {p, t) is greater than S {p, t) and T (p, t) which means that 
the optimal action for the secondary user is to stay idle. At 
Pi<P<P2^S [p, t) is greater than / (p, t) and T {p, t) which 
means that the optimal action for the secondary user is to sense 
the spectrum. At p > P2, the optimal action is to transmit. 

B. Traditional scheme 

Here we consider the case where the secondary transmitter 
has a fixed sensing and transmission durations for all its access 
period. We show the variation of the secondary user utiUty with 
7 at different values for the fixed sensing and transmission 
durations. In this part we consider perfect/imperfect sensing 
with/without the overhead. 

1) Perfect sensing without overhead: The change of the 
secondary user utility with 7 at different sensing durations 
is shown in Fig. |3] We fix the transmission duration at 
Tt = 10. Increasing the sensing duration for the secondary 
transmitter decreases its utility since we assume in this part 
that the secondary user has a perfect sensing mechanism. 
That is, the optimal sensing time for the secondary user is 
Ts = Tsmin = 1- The secondary user obtains no gain from 
increasing the sensing duration as Pfa = and Pd = 1 and, 
in fact, sensing wastes time that can potentially be used for 
data transmission. We notice from the figure that the secondary 
user utility Us {p, t) increases with 7, because as 7 increases, 
the collision cost decreases and, hence, the utility increases. 

The effect of varying the transmission duration on the Us 
versus 7 curve at a fixed sensing time T5 = 1 is shown in 
Fig.|4] When a = 0, the secondary user utility decreases as the 
transmission duration increases at fixed sensing duration. This 
decrease is reduced as 7 increases. The reason for this is that 
a longer transmission duration means a higher probability of 
colliding with primary transmission. Since when 7 increases, 
the collision penalty decreases, the degradation caused by a 
long transmission duration is reduced. 

2) Perfect sensing with overhead: We now study the sec- 
ondary utility considering the data overhead. The case cor- 
responding to Fig. |3] with overhead is not provided here as 
the utility is reduced at all 7 due to overhead. However the 
effect of varying the transmission duration at a fixed sensing 
time Ts — I produces an interesting change as shown in 
Fig. |5] When the collision penalty is low at high 7's, small 
transmission durations barely above the data overhead result in 
small secondary utility relative to high transmission durations. 
On the other hand, at high collision costs, long transmission 
durations reduce the secondary utility. 

3) Imperfect sensing with overhead: Recall that the false 
alarm probability varies with the sensing time according to (|9|l- 
We use the following parameters in the imperfect sensing part: 
detection probability Pd — 0.9, B.W. — 50KHz, sampling 
frequency fs = 5/8B.W., ^ = -2bdB and Ks = 0.01. 
Fig. |6] shows the performance of the secondary user utility 
while varying the sensing duration at fixed transmission time 
Tt = 10. The case here is different from Fig.|3] As previously 
mentioned, under perfect sensing, the optimal sensing duration 
is the minimum possible. Increasing the sensing duration 




Fig. 7. Comparing tlie adaptive sclieme witli tlie traditional one at fixed 
Tj = 5 and varying Tt and Tg , assuming perfect sensing witli data overhead. 
Note that the improvement in the secondary utility is due to the adaptive 
transmission duration not the varying sensing duration. 




Fig. 8. Comparing the adaptive scheme with the traditional one at fixed 
Tj = 5 and varying Tt and Ts under perfect sensing without data overhead. 
In this case only, the adaptive structure converges to the non-adaptive one 
where the optimal sensing and transmission durations are the fixed ones. 




Fig. 9. Comparing the adaptive scheme with the traditional one at fixed 
Tj = 5 and varying Tt and Tg. Sensing is imperfect with overhead. Note 
that in this case, the optimal durations are the adaptive ones. 



brings no additional benefit or refinement of the sensing 
outcome. In the imperfect sensing case, there is a tradeoff 
between increasing the sensing time and consequently getting 
a lower probability of false alarm, and decreasing it to have 
more time for transmission. 

This tradeoff is well demonstrated in Fig. |6] As the sensing 
duration is increased, we get a higher utility due to the lower 
false alarm probability. However an excessive increase in the 
sensing duration degrades the performance because little time 
is left for transmission. 

The effect of varying the transmission duration for the sec- 
ondary user while fixing the sensing duration is not depicted 
here as it is similar to Fig. |5] with a degradation of utility at 
all collision costs due to imperfect sensing. Simulations for 
the imperfect sensing without the overhead are omitted. 

C. Adaptive scheme 

In this part we compare our adaptive sensing and trans- 
mission scheme with the traditional fixed one. We show that 
using adaptive durations for sensing and transmission returns 
a higher utility for the secondary user We set TT^min = 1, 



Tt 



30, Ts. 



1, Ts. 



10. 



1 ) Perfect sensing with overhead: The effect of varying the 
sensing and transmission durations for the secondary user at 
each value of 7 is shown in Fig. |7] where we fix the sensing 
duration in the traditional structure at = 1 and simulate the 
system at different transmission durations. We notice that at 
every value of 7 which corresponds to a certain collision cost, 
the adaptive structure returns a higher utility for the secondary 
user than using fixed durations. However solving our constraint 
optimization problem in the perfect sensing case gives us a 
fixed sensing duration and an adaptive transmission one. That 
is the optimizer returns bQ = l and bl = for all values of 7 
which gives a sensing duration as: 



Ts = b*+ blp = 1 



(34) 



This is what we expect for perfect sensing case. 

Regarding transmission durations, solving the optimization 
problem returns different values for Qq and for every value 
of 7. The higher utility in Fig. Q is due to the adaptive 
transmission duration at every value of 7, not the varying 
sensing durations. This is not the case under imperfect sensing 
where the optimizer chooses 6J 0. 

2) Perfect sensing without overhead: Here &q = 1 and = 
for all values of 7 which gives a sensing duration as in 
equation (|34] |. Also for the transmission time, the optimizer 
returns Qq = 1 and — for all values of 7 which gives the 
minimum transmission duration as expected due to the absence 
of the overhead. 



Tt = Oq + alp = 1 



(35) 



Actually in this case only, the adaptive structure converges to 
the fixed one as shown in Fig. [8] i.e. the secondary transmitter 
uses fixed sensing and transmission durations in order to 
maximize its utility. 



3) Imperfect sensing with overhead: To test the impact of 
adapting both the sensing and transmission durations for the 
secondary user we consider in this part the imperfect sensing 
with overhead case. Solving our optimization problem, we 
found that both the sensing and the transmission durations 
are varying adaptively according to the belief that the primary 
user is idle and that aj, aj, &q and bl will all have values that 
do not equal to zero at all values of 7. Fig. |9] shows that it 
is better for the secondary transmitter to adaptively change its 
sensing and transmission durations according to the belief p 
as that increases secondary utility. We notice that at 7 = 1, 
the optimizer chooses the maximum transmission time and the 
minimum sensing time as follows because at this value of 7 
there is no collision penalty. 



Tt = Tt„ 
Ts = Tsn 



flp + a^p 

b* - blp 



(36) 
(37) 



V. Conclusion 



We have developed an adaptive scheme for the sensing 
and transmission durations of a secondary user sharing a 
channel with a primary user. The sensing and transmission 
durations are varied adaptively according to the secondary 
belief regarding primary activity. The objective is to maximize 
the secondary utility which takes into account the impact of 
the secondary user's decision on the future. Simulation results 
have demonstrated that the proposed adaptive scheme returns 
a higher utility than the non-adaptive one. 

Several interesting directions for future work exist. For 
example, a power control scheme can be incorporated so 
that the secondary adapts its transmission power based on 
its belief regarding the primary state of activity and also the 
channels connecting the primary and secondary transmitters 
and receivers. Furthermore we can consider the case where 
there is a kind of cooperation between primary and secondary 
users. The incentive for the primary user would be some extra 
revenue or some help from the secondary user in relaying its 
message. Finally, the investigation can be made more realistic 
by incorporating the primary and secondary queues in the 
analysis. 

Appendix 

A. Derivation of Pqq (t) and Pio (t) 

Probabilities Pqo (t) and Pio (t) depend on the on and off 
distributions of the primary user. Using renewal theory, Pn (t) 
can be expressed as: 



^^^du+ f hw{u)fonit-u) du (38) 



where hiQ iu) is the renewal density of the off state given that 
the renewal process started from the on state. It is proven in 
lfT3l that hiQ (s) is given by: 



/iio (s) 



/off {s) (1 - /on (5)) 
^onS (1 - /on (s) /off (s)) 



(39) 



By applying Laplace transform to equation (I38|) . we get 

p . , 1 (l-/on(g))(l-/off(5)) 

Pii (s) = - - - — YT\ f I \ f — TTT ^^^^ 

S TonS^ (1 - /on (s) /off (Sj) 

Our objective now is to derive the formulas of Pio [t] and 
Poo [t)- Using the inverse Laplace transform of equation (l40l) . 
we can get Pio [t) as: 



(41) 



Pio (t) = 1 - Pii {t) 

Similarly, using the inverse Laplace transform of equation (|42)) 

we can get Pqo [t) as 

p . ^ 1 (1 - /off {S)) (1 - /on {S)) .... 

Poo s = - - jj- — , , (42) 

S PoffS^ (1 - /off (s) /on (Sj) 

We now focus on the case when /on (t) and fog (t) are 
uniformly distributed on the interval [0, b] 

Ton — Toff — - 
fan (t) = /off (t) = j[u{t)^uit~ b)] (43) 

Using Laplace transform we can get: 

/ (s) = /on (s) = /off (s) = ^ [1 - exp i-bs)] (44) 



Pii (s) 



1 



(i-/Wf 



S Tons'' (1 - P (S)) 
1 (l-/(.^)) 



S Tons' (1 + / (s)) 

_ 1 2 1 

^ 1 ' TonSHl + f (s)) ~ Tons' 

1 4 1 



+ x + x' + + 



s s(6s + 1 -exp(-5s)) (6/2) s2 

Using the Maclaurin series for = 1 
where x — 'iT'"''' we get the followin 



(45) 



4 cxp( — 6s) 
(6/2)s^ ^ ^s(l+bs) s(l+bs)2 

+ ...) 



4oxp( — 26s) , 4oxp( — 36s) 
s(l+6s)^ ' s(l+bs)^ 



- (.9o(s) +5i(s)exp(-6s) 



s (6/2)s2 ' ^ . : —r- x , ^^^^ 

+ 32(5) exp(-26s) + 33(5) exp("36s) + ...) 

After some algebraic computations we get the following: 

Pii(t) ^ u{t)-'^u{t)-{g^[t)+g^{t-b)u{t-b) 

+ gi{t - 2b)u{t - 2b) + g3(t - 36)M(t - 35) + ...) 

(47) 

since 

1 < < < 5 

Therefore, we get only the first term of the series go(t) 

Pii(i) = uit) ~ -u(t) - (4 - 4exp(-i/6)) (48) 




Pio (t) = 1 - Pii (t) 



then at t = Ti we get the following: 

PwiTi) = (5 - 4exp(-T//6)) - u{Ti) + ^u{Ti) (49) 

Following the same argument for Poo(s)> we can get Poo{Tj) 
as follows: 

2Tt 

Poo{Ti) = u{Tj) - -j-^Ti) - (4 - 4exp(-T7/&)) (50) 
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